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Grade Equivalents; We Report Them, You Should Too 



Grade equivalents (GEs) are better than percentiles for answering several 
key questions about student achievement on standardized tests. Raw scores, 
normal curve equivalents (NCEs), and standard scores are dysfunctional when 
trying to explain achievement test scores to most audiences. Now, as a 
reader, you must be looking for some hard data to back up these state- 
ments. This paper contains some data, but is more a position paper on the 
best statistic for reporting standardized achievement test results based 
upon the authors' experiences in managing achievement testing in an urban 
public school system. 

GEs have long been the subject of much debate within the educational and 
psychological arenas. Many teachers, principals, and counselors have shown 
an acquired distaste for them— perhaps in some cases inspired by a 
provincial university professor without adequate experience in the public 
school realm. Sometimes the opposition has come from our own ranks, as 
evidenced by Attachment A, a nine-year-old expose from our own testing 
office. While canon ■isconceptions and Misuses of GEs have been discussed 
and docuaented, too poorly dooaented are the questions for which GEs are 
the Mst appropriate statistic available. 

We intend to clarify some issues in the lingering debate over the use of 
GEs. Moreover, the relative usefulness of different statistics will be 
examined in relation to their abilities to answer essential questions for 
assessing student learning. Our objective is to convince practitioners 
that SEs have a unique and important contribution to make to educational 
evaluation and the reporting of student achievement progress. ' 

Let us dispense with the utility of raw scores, standard scores, and normal 
curve equivalents as statistics for the common folk (who make up the 
majority of our audience for test results). Uhen we have to spend More 
tine defining and explaining the limited infonaation these statistics 
carry, they becone dysfunctional. More importantly, they do not 
communicate with most lay audiences as well as do GEs or percentiles. To 
be fair, the testing practitioner will find raw scores and NCEs but 
especially standard scores to be more useful and appropriate for 
calculating gains and averages and for tests of significance; however, 
converting thero to either GEs or percentiles is practical before 
communicating the results. 

This leaves us with GEs and percentiles as competitors for our endorsement 
as the best statistic for reporting to general audiences. In this paper, 
we present the position that neither is omniscient, but each answers 
specific questions better than the other. 



85,47 



Our Perspective 

We shall begin with some background and content. Austin is a central Texas 
urban community of approximately 350,000 - 450>000 people composed of 
heterogeneous groups having varied interests. The University of Texas, the 
State governmental offices and political community, high-tech industry, and 
many other facets make Austin and the students in the Austin Independent 
School District (AISD) a sophisticated blend of sun-belt diversity. 
Citizens of Austin take a healthy interest in the affairs of AISD and the 
progress of its students. 

The School District itself has long been known for innovative programs and 
strategies and, in many areas, has become a standard for emulation. 
Students consistently achieve above the national average at grades 1-12 in 
all achievement areas. Compared to urban districts nationwide, the average 
AISD student in grades 1-8 achieves higher in all areas than three fourths 
of the students tested across the country. The average AISD student at 
grades 9-12 achieves higher than two thirds of the students in urban 
districts nationwide. On the other hand, one third of AISD's students are 
from low-income families and almost half are either Hispanic or Black. 
What this shows is that we have both a sophisticated and varied clientele 
to which we report achievement test results. 

The Office of Research and Evaluation (ORE) is responsible for all types of 
systemwide testing. Students in grades K-8 take the Iowa Tests of Basic 
Skills (ITBS), while high school students (grades 9-12) take the Tests of 
Achievement and Proficiency (TAP). We also coordinate the administration 
of the Texas Educational Assessment of Minimum Skills (TEAMS), a 
State-mandated testing battery administered in Texas to students in grades 
1, 3, 5, 7, 9, and 11-12* 

ORE has been reporting gEs in addition to percentiles to central office and 
campus staff for 6 years. Since 1984, both GE and percentile scores have 
been sent home to parents of students in grades 7-12. (Parents of 
elementary students receive percentile scores only; however, GEs may be 
provided in conferences with elementary teachers.) 

One of our primary responsibilities is doing as much as possible to 
communicate test (and other evaluation) data to all groups (e.g.,^e 
School Board, teachers, parents) who may utilize it. Moreover, our 
mission is to educate effectively these groups in the understanding, 
interpretation, ana use of test scores and related information. Only by 
comprehending all test data available can we and others fully (a:^d most 
cost-effectively) evaluate the state of learning and the progress made by 
the individual student, the class, the school, or the District. 

As educators, we need to identify specific questions which must be answered 
when determining progress. What issues are most critical in which 
situations? What score would best provide the insight needed for an 
effective evaluation? These and others are the questions which, 
ultimately, we seek to answer. 

2 4 
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Our data set and the resources drawn together for this paper are derived 
from our years of coordinating and administering standardized tests, and 
from the experience gained in executing and molding a public school 
testing program. In addition, in November, 1985, we surveyed elementary 
school teachers about their attitudes towards the accuracy and reporting 
of GEs. 

Types of Test Scores 

Many types of data are available for testing practitioners to use when 
reporting results to various audiences. We must first examine some 
characteristics of each, citing strengths and weaknesses, both from a 
practical and a technical standpoint. 

Percentile rank scores are the most widely reported testing statistic. 
They offer a comparison of each student to a norm group, in this case a 
representative sampling of thousands of students nationwide. Percentiles 
can be readily understood at fece value by school staff as well as the 
public; however, they are frequently misunderstood and misused when 
employed as the sole index by which to examine progress over a period of 
time. From a technical standpoint, percentile rank scales are frequency 
distributions having significant inequality of intervals among points on 
the scale, especially at the extremes of the distribution. Conseque?itly, 
percentile rank scores show the student's relative position compared to 
the norm group, while offering limited information on amounts of learning 
differences between pre- and posttest data and other aspects. 

Standard scores (SSs), which include stanines and NCEs, offer a different 
perspective. Standard scores express the student's distance from the 
mean in terms of the distribution's standard deviation. The process used 
to derive standard scores yields a scale with a distribution and 
properties corresponding to those of the raw score distribution scale. 
The result is that, without any distortion of results, computations and 
other manipulations that can be executed using the original raw scores 
can In most cases also be done using standard scores. Additionally, 
standard scores are useful to make comparisons between results from 
different test instruments, providing a common basis and scale for 
evaluation. 

Reporting standard scores in most cases would be meaningless to the 
public. The numerical values used and their relationships would be 
confusing and inappropriate for imparting information to the public. 

Normal curve equivalents (NCEs) are similar to a stanine scale with the 
stanines split into ten parts each. NCEs (and stanines) are useful 
because they provide equal-interval scales that may be averaged; however, 
they also have drawbacks limiting their use in reporting results in a 
public school setting. Namely, they are often interpreted as if they are 
percentiles. 



85.47 



Grade equivalents (GEs) offer the perspective of where the student falls 
compared to a developmental continuum constructed based on the norming 
sample. For example, a GE of 6.7 means the student made the same raw 
score as would the average stwdent in the seventh month of grade six. 

Historically, GEs have been somewhat misunderstood by school staff, 
parents, and other groups. For example, a parent might mistakenly think 
that his child in fourtf^ grade who scored a GE of 6.7 should actually be 
enrolled in the sixth grade. 

From a technical consideration, GE scales have varying degrees of useful- 
ness. Although the ITBS GE scales are derived to be equal interval 
scales, providing users the luxury of having flexibility in utilizing the 
data, this is not true of all GE scales. When deciding whether to report 
GEs, a careful scrutiny of the norms must be conducted. 

Frankly, AISO would not be reporting GEs if we had not investigated and 
found the ITBS GEs to be reasonably equal interval and to meet certain 
simple standards (i.e., the GE associated with the 50th percentile must 
be the grade and month representing the critical norming date; a 50th 
percentile student must gain 1.0 GE a year to maintain the 50th 
percentile; students above the 50th percentile must gain more than 1.0 GE 
in a year to maintain the same percentile; students below the 50th 
percentile must gain less than 1.0 GE in a year to maintain the same 
percentile.) 

Problems with Grade Equivalents and Percentiles 

Unfortunately, our chosen champions among these statistics are both 
deeply flawed. 

A brief overview of the problems most often cited in the use of GEs is 
provided in Figure 1. Also provided is an overview of the similar 
problems with percentiles. Because both can be (and are) misinterpreted, 
a logical conclusion is that the misinterpretation of GEs must be more 
problematic for educators and influences their discrepant acceptance of 
GFs and percentiles. This may be plausible considering the misinter- 
pretation of GEs can more easily be used by parents to argue for double 
promotions to higher grades, or to express concern over how far behind a 
student is. For teachers, there may be the frustration of wanting from 
their investment in standardized testing a GE that perfectly diagnoses 
the best instructional level for students, but not finding that perfect 
diagnosis. 

He believe that the shortenings of percentiles are merely less well 
knom than are those of GEs. In our experience, percentiles are 
misinterpreted and misunderstood at least as often as are GEs. 
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Figure 1. PROBLEMS WITH GRADE EQUIVALENTS AND PERCENTILES. 



1. 



2. 



3. 



Grade Equivalents 

Cannot be averaged 
(Unless developed as an 
equal -Interval scale) 

Often misinterpreted, 
for example: 

a. Fifth grader scoring 
at 6.5 could succeed 
in sixth grade. 

b. Fifth grader scoring 
at 6.5 should be in 
a sixth-grade text. 

c. A GE of 18.5 is 
more than just a 
theoretical 
projection. 



Host GE score points 
are either interpolated 
or extrapolated rather 
than empirically derived. 



Pertrentlles 
1. Cannot be averaged 



2. Often misinterpreted, 
for example: 

a. Percentile means the percent- 
age of items answered correctly. 

b. A 69th percentile is a failing 
grade. 

c. Student at 50th percentile 
two years in a row did not 
gain (learn). 

d. Student at 30th percentile 
two y&ars in a row gained a 
year, kept place with 
average student. 

e. Student going from i5th to 
16th percentile is catc^iing 
up with average student. 

f. Student at 65th percentile in 
in both reading and math is 
equally above grade level in 
both areas. 



When Should Achievement Test Results Be Reported in Grade Equivalents o r 
Percentiles? 

This is the key question in this paper. By examining the questions that 
parents and educators ask about student achievement, we can determine 
which can be answered better by either GEs or percentiles. Figure 2 
summarizes the seven critical questions most frequently encountered in 
our achievement testing experience. 
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After much agonizing over how these seven questions could be conceptually 
related to each other, we were able to locate each question along two 
diniensions~"cQniparison standard" and "time." The five comparison 
standards include: 

. Grade Level— that almost mystical level that indicates average 
or expected achievement, as in being "on grade level"; 

. Peers— other students in the same grade level (or age. level); 

. All Students, All Grades— students from prekindergarten 
through grade 12 who take standardized tests; 

• Self— the student's own past achievement; and 

. Students at the Same Achievement Level — students who made the 
same score on past achievement tests. 

The two time periods include: 

. Statvis — achievement to date; and 

. Gain— achievement improvement within the last measurement period 
between a pre- and posttest. 

A brief discussion of the rationale for selecting either GEs or 
percentiles as more appropriate for each of the seven questions follows: 

1. Is the student above, at, or below grade level? (Comparison 
Standard: grade level; Time: status) 

Both GEs an:! percentiles answer this question. Grade level can 
be defined as a point representing the exact average score at the 
critical norming date (i.e., 50th percentile or a GE equivalent 
to the year and month of testing) or as a range of scores around 
that point. 

2. How does the student rank among others in the same grade? 
(Comparison Standard: peers; Time: status) 

Percentiles provide a rank, GEs do not. 

3. How far ahead/behind is the student? (Comparison Standard: 
peers; Time: status) 

GEs provide an indication of how far (in months and years) a 
student differs from the average. Percentiles tell how far from 
average a student is in terms of a percentage of students, but 
this does not communicate in terms that answer the question. 

4. Did the student learn more, the same, or less than the average 
student? (Comparison Standard: peers; Time: gain) 
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6Es answer this question because by definition the average 
student gains one year (1.0 GE) in a year's time. Percentiles 
answer this question only for certain students, i.e., those who 
score at the 50th percentile on the pretest (a score below 50 on 
the posttest indicates a gain les* than the average student, a 
score above 50 indicates a greater gain* and a score at 50 
indicates an equal gain), these wfic score below the 50th 
percentile on the pretest and the same or lower on the posttest 
(this represents less than the average student's gain), and those 
who score above the 50th percentile on the pretest and the same 
or higher on the posttest (this represents greater than the 
average student 'f; gain). 

5. In which grade would the average student score the same as this 
student: (Comparison Standarfi: ail students, all grades; Time: 
states) 

This is the definitioTJ of GE. 

6. Did the student learn at a rate that is faster, the same, or 
slower than in the past? (Comparison Standard: self; Time: gain) 

Percentiles easily answer this question because a student who 
maintains the same percentile rank on the pre- and posttest has 
learned at the same rate as in the past. A higher posttest 
percentile indicates a faster rate than in the past; a lower one, 
a slower rate. 

7. Did the student learn at a rate that is faster, the same, or 
slower than other students who began at the same level? 
(Comparison Standard: students at the same achievement level; 
Time: gain) 

Percentiles easily answer this question because a student who 
maintains the same percentile rank on the pre- and posttest has 
learned at the same rate as other students who were at the same 
level. A higher posttest percentile indicates a faster rate than 
in the past; a lower one, a slower rate. 
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Figure 2. WHEN SHOULD ACHIEVEMENT TEST RESULTS BE REPORTED IN GRADE 
EQUIVALENTS OR PERCENTILES? 



COMPARISON 
STANDARD 



TIME 



Status: 
Achievement 
To Date 



Gain: 
Within Last 
Measurement Period 



Grade Level 



Peers 



%ile or GE: 



Is student above, 
at, or below 
grade level? 



All 
All 



Students, 
Grades 



Self 



%ile: How does student 
rank among others 
in the same grade? 

GE: How far ahead/behind 
is this student? 

GE: In which grade would 
the average student 
score the same as this 
student? 

N/A 



Students at 
the Same 
Achievement 
Level 



N/A 



N/A 



GE: Did student learn 
more, the same, or 
less than the 
average student? 



N/A 



%ile: Did student learn 
at a rate that is 
faster, the same, 
or slower than in 
the past? 

%ile: Did student learn 
at a rate that is 
faster, the same 
or slower than 
other students 
who began at the 
same level? 



Implementation 

Having identified the benefits of GEs, the reporting of them in our District 
was the logical and appropriate course to follow. However, because the 
statistic is one so frequently misunderstood and misused, a massive training 
effort had to be undertaken throughout the District. Every segment of 
potential users, including the School Board, central administration and campus 
staff, and the students and parents, had to be informed in varying degrees 
about GEs. 
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An assortment of materials was created, including a packet of handouts 
for each teacher. In addition, ORE produced a videotape dealing with 
understanding and using 6Es that was broadcast on AISD's cable tele- 
vision channel for viewing during faculty meetings and at other times. 
Attachment B contains samples of some materials produced for training 
campus and central office staff about GEs. 

Conversations, notes, and comments over the past few .years indicate that 
misconceptions and misinterpretations about test scores are persistent 
and resistant to training. 

Observations and Results 

GEs have now become a commonly used statistic in AISD. Teachers, 
counselors, and principals appear to have begun to grasp the concepts of 
GEs and, in discussions with ORE staff, seem to prefer them in many cases 
(e.g., when discussing issues dealing with pretest and posttest data). 
However, we wanted some actual data on the acceptance level of GEs. 

Therefore, in November, 1985, a sample of AISD elementary teachers was 
surveyed about their attitudes towards GEs. For two of the questions. 
Figure 3 provides the percentages of teachers answering with each 
response. 

Of the teachers who responded to Question 1, 69.4% expressed positive 
support for the reporting of GEs in addition to reporting percentiles. 
However, this support is confounded by the responses to Question 2, which 
concerns the misleading properties of GEs. While 31.33; of the teachers 
responding thought that GEs are not misleading, 39.13; said that GEs are 
in fact misleading indicators of student achievement levels. 



Figure 3. RESULTS OF GRADE EQUIVALENT (GE) SURVEY OF ELEMENTARY SCHOOL 
TEACHERS, NOVEMBER, 1985. 



Question: 


1. Having GE scores 
reported in 
addition to per- 
centiles is helpful 
to me. 


2. GE scores are mis- 
leading indicators 
of student achieve- 
ment levels. 


Responses: 
Strongly Agree 
Agree 
Neutral 
Disagree 
Strongly Disagree 


30.6X 
38.8% 
20.9% 
7.5% 
ZA% 


12.0% 
27.1% 
29.6% 
24.8% 
6.5% 


lotal Number of 
Responses 


425 


443 


lotal Number of 
Survej^ Sent 


563 


575 


Percentage of Teach- 
ers Responding 


75.5S 


77.0X 
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An additional survey item asked fifth-grade teachers to identify the 
reading instructional level of a particular student In their class. (One 
student in each class was selected by us. The total group of students 
selected spanned the range of low to high achievement, with most students 
chosen being around the average achievement level.) The response options 
provided on the survey item covered the achievement range from 2.3 to 
8.3, with responses available in-between at increments of .5 GE. This 
gave teachers the option of choosing "5.3" (on grade level for the date 
of the survey) or values lower or higher than 5.3 in increments of ,5 
year. Teachers' responses were compared to actual ITBS Reading Total 
scores from the previous April's testing (after adding .5 GE to account 
for the difference between date of testing and date of survey). Figure 4 
contains the survey item and the results. 

Clearly, the ITBS GE and the teachers' estimates of the students' actual 
reeding levels are close. Inspection of Figure 5 shows that the ITBS GE 
may overestimate actual classroom performance/instructional levels for 
the higher scoring students; however, a larger sample with more precise 
measurement is needed to verify this. 

The bottom line appears to be that GEs can provide teachers a gross 
estimate for a student's instructional level, and the majority of 
teachers find having both GEs and percentiles to be helpful. However, 
there is obviously still a high level of skepticism on the part of our 
teachers as to the accuracy of GEs as indicators of student achievement 
levels. 
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Figure 4: COMPARISON OF FIFTH-GRADE TEACHERS' ESTIMATES OF STUDENTS' 
READING LEVELS AND LATEST ITBS SCORE. 



I TBS Achievement 
Level Minus 
Teacher's Estimate 


Number 
of 
Teachers 


Percentaop 

of 
Teachprs 


+3.0 


0 


0.0% 

V/ • \J to 


+2.5 


1 


1.7% 

^ • t to 


+2.0 


5 


8.6% 


+1.5 


3 


5.2% 


+1.0 


8 




+0.b 


11 


19.0% 


0.0 


11 


19.0% 


-0.5 


16 


27.6% 


-1.0 


2 


3.4% 


-1.5 


0 


0.0% 


-2.0 


1 


1.7% 


-2.5 


0 


0.0% 


-3.0 


0 


0.0% 




N = 58 





Teacher's estimate and 
actual ITBS GE were both 
expressed to the nearest 
.5 GE within a range 
from 2.3 to 8.3. 



Correl ati on Coef f i cient 
Teacher's Estimate and Actual 

ITBS GE = .760 
Teacher's Estimate and Rounded 

ITBS GE = .738 
(P < .COOl for both) 



QUESTION: Which grade equivalent listed below is closest to the reading 
instructional level of ? 



A. 


2.3 


B. 


2.8 


C. 


3.3 


D. 


3.8 


E. 


4.3 


F. 


4.8 


G. 


5th 


H. 


5.8 


I. 


6.3 


J. 


6.8 


K. 


7.3 


L. 


7.8 


M. 


8.3 
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Teacher's 
Response 



14 



1.3 



7.1 



7.3 



6.a 



i.3 



5.a 



S.3 



4.1 



3.1 



3.3 



2.1 



/ 



C B k 



/ 



S 3.3 3.a 4.3 U S.3 5.6 6.3 i.a 

ITBS GE 

Figure 5. PLOT OF TEACHERS' RESPONSES TO SORVEY OUESTION. 
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Conclusio n 

The authors are not taking the position that GEs are the best statistic 
(they are not in many circumstances), nor the position that GEs will be 
used without misinterpretation (they seldom have been), but the position 
that GEs provide a perspective that is missing from other statistics and 
is necessary for complete understanding of a student's achievement level. 

Some data have been presented to show that the majority of teachers 
probably find having GEs reported to be useful and to show that GEs can 
predict a student's approximate instructional level in reading. 

Clearly GEs have their technical shortcomings, but equally as clear is 
the conclusion that GEs help us answer questions that other statistics 
respond to inadequately. 

Challenge 

We challenge test publishers to develop grade equivalent scales that are 
empirically derived with a minimum of interpolation and extrapolation. 
We challenge testing practitioners to recognize the usefulness of GEs and 
to use them wisely and cautiously to answer the questions for which 
percentiles are inadequate. 
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ATTACKMFNT A 

EARLY ORE PUBLICATION 

CONCERNING GEs 



THE TRUTH ABOUT 
GRADE EQUIVALENT SCORES ! 



OFFICE OF PESEARCH AND EVALUATION 
Austin Independent School District 



Pub. Ko. 76-18 



January, 1977 



Grade equivalent scores have a nuaber of real technical drawbacks which lead to practical 
problens when thsy are used to interpret a student's performance on a standardized test. 
The following are some fables about grade equivalent scores that O.R.E. has compiled in 
the tradition of Aesop* 

Q Johnnie, a Sth grade student, received a 4.0 
grade equivalent score in both math and reading. 
Conclusion: Johnnie's teacher does not nned to 
concentrate on one subject more than the other. 

What Ajb tofiong uUth thJU InteApfLetaution o{i gnade. cxiuival^nt 6C0A.t6? 




As this table shows, even though Johnnie 
scored the same grade equivalent in 
reading and math, his percentile in a 
national sample of Sth grade students 
for these subjects indicates a lower 
rank for math than for reading. 



%(le8 for GE of 4.0 


Reading 


Math 


20 


13 



MORAL : 



Comparing a student's girade equivalent scores in two different 
content areas is inappropriate and can lead to incorrect 
instructional decisions* 



Emma, a 4th grade student, scored a 7*5 grade equivalent on the 
reading section of a standardized test. Obviously, Emma is a 
much more advanced student than most 4th graders and should 
possibly be provided with enrichment materials in reading. 
On the basis of this test score, her teacher, Ms* Wagner, decides that thf appropriate 
level of reading enrichment materials that would best suit Emms*s capability is 7th 
grade reading materials* 

U thU tut oi gKodt tqwivalavU cofuitctt 

A grade equivalent score of 7.5 on this test indicate^ that Emma reads the material! on . 
this particular test which was designed for 4th grade students as well as would the 
average 7th grader* Ve do not know how well Emma might read materials on a test designed 
for 7th graders* Accordingly, there is no bssis for assuming that Emma can adequately 
read and comprehend 7th grade materials* 

M|^n A V » Grade equivalent scores should not be the sole basis for 
^ • deciding which instructional materials a student should use. 
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On a standardized test for the 4th 
grade level, gr^de equivalent 
scores In reading of non-Title I 
students were higher by 2.3 years 
than those of Title I atudeats. For 5th graders, the 
difference was 2.6; and In the 6th grade, the difference 
In average scores was 3.0 years (Figure 1). The conclu- 
sion drawn from this Is that "Title I students are losing 
ground every year." 



Percentile 
Scores 



90- 



70-. 



l6 tkU conibuion coAAtctf 




5 

Grade 
FIGURE I 



Non-Title I 




Title I 



4- 



50-- 



Hot entirely. The 
problem Is that the 
grade equivalent 
scores do not provide IOt 
all nf the Infonutlon 
that Is needed. There 
Is evidently a progressive 
gap In absolute achleveaent 
between the group of Title I 
students and Che group of non-Title I students. But, as 
Figure 2 Illustrated, these Title I students are not 
losing ground In terns of their relative ranking with 
other students. Actually, the Title I student average 
for each of the Grades A, 5, and 6 corresponds to the 
saree peicentl.le score (2SZlle) for each grade. 



1— 

5 

Grade 
FIGURE 



Comparing grade equivalent scores 
j^QK^I^ • of a hlgh-achleviTig group of stu- 
* dents with a low-achieving group 
of students, without considering 
the corresponding percentile 
scores, can lead to Incorrect 
conclusions. 



Harvln Howard, a second gsrar^er. Is given Level 2 of a standardized 
test. He scores an 8.4 grade equivalent on the math test. The 
counselor at Marvin's school tells Mrs. Howard that •'Marvin did as 
well In this test as did the typical 8th grade student who took the 
test In the fourth nonth of the school year.*' 

I4 tliVLt anything ujfiong thJU 6tatvnavtt 

vmen the publishers of this test tried out their Level 2 test, they gave 
It to 2nd, 3rd, and 4th graders. They did not give the test to 8th gra- 
ders. Using the scores from the 2nd, 3rd, and 4th graders, they estima- 
ted what score an 8th grader would achieve. Therefore, all that can 
truthfully be said about Marvin's score of 8.4 Is that "If a typical 8th 
grader had been given this Level 2 test during the fourth month of the 
school year. It Is a good guess to say that he would have made about the 
sane score as did Harvln." 

MORA I • ^" ord^T to interpret grade equivalent scores correctly, \j 
iiw^mxrKMa • one must be aware of how they are derived and be justly wary of extreme 
projections such as the one in this Illustration. 
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ATTACHMENT B 

GRADE EQUIVALENT TRAINING 
MATERIALS 



UNDERSTANDING GRADE EQUIVALENTS AND PERCENTILES 

TYPE OF score: Grade Equivalents 

DEFINITION: ^ grade equivalent represents the grade placement (year 

and month) for which a given raw score Is average (the median) . 

QUESTIONS ADDRESSED: ^' close to grade level Is a student achieving? 

2. Did a student learn as much from one year to the next 
as did the average student? 



TYPE OF score: 

definition: 
questions addressed: 



Percentile 

A percentile represents a student's rank — the percentage of 
students which scored lower than a student. 

1. What proportion ot students did a student score higher 
or lower than? 

2. Did a student learn as much from one year to the next 
as other students who were ac the same percentile rank 
the first year? 



(Match the at the top and bottom of this page with the on the next page.) 

To the left Is a scale which represent the full range of possible grade equivalent 
scores for the ITBS Vocabulary Test, Levels S-14. 

This part of the scale represents the sixth grade: 6.0-6.9. The ten decimal places 
are equivalent to one month each. The three summer months together equal one month, 
and the nine months of the school year equal one month each. 

Over a 12-month period » how much does this grade equivalent scale go up? 

What grade equivalent represents the national average for the tlo'A that AISD tests 
students In grade 8? 

What grade equivalent represents the national average for the time that AISD tests 
students In grade 3? 

(M«fttch the ^1 at the top and bottom of this page with the on the next page.) 

The percentiles shown to the left are for the months of testing In AISD (April, 
grades K-6; February, grades 7 and 8). 

What percentile always corresponds to the grade equivalent for the month of testing? 

What grads equivalent gain Is made by a student who scores In both kindergarten and 
first grade at the 50th percentile? 

What grade equivalent gain Is made by a student who scores In both kindergarten 
and first grade at the 25th percentile? 

What grade equivalent gain Is made by a student who scores In both kindergarten 
and li ^'Tade at the 99th percentile? 

CAUTION: .rsde equivalents do not necessarily represent the best Instructional level 
:or a student. A fourth grader scoring at 7.8 should not necessarily be 
^'Orklng In seventh-grade terts. Although this fourth grader knows fourth- 
gr&de material as well as the average seventh grader, this ututi^nt riay not 
know sevent'i-grade material as well as seventh graders do. 
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Month of 
Testing 

In 



Parcehtilas Associated with Grada Equivalents 



■ ttr 

■ »• 
ii» 
11 « 
ttj 

tti 

119 

II ■ 
n ■ 



II « — 

II 3 — 
ni — 



no 

!&• 

- «■ 
•Of 

ie» 
ie« 

lOt 
tOl 
109 



It 



• I 



SI 
SI 



S3 



What Is the range of grade equ'lva- 
lents between the 1st and the 99th 
percentile In kindergarten? 

... la grade 8? 



The table below shows the gr&de 
equivalent gain needed to maintain 
the sane percentile rank from K 



PI 



through 8. 








Percentile 






GE 


_ _P«jk 


K GE 


8th GE 


Gain 


1 


P.4 


3.5 


4,i 


25 


K.3 


7.0 


6,7 


50 


K.8 


3.6 


7.8 


75 


1.4 


10.0 


8.6 


99 


2.6 


13.2 


10.6 



What percentile goes with a grade 
equivalent of 3.8 In grade 1? 



.. . In grade 2? 

In grade 3? 

In grade 4? 
... In grade 8? 



Which type of score can be Inter- 
preted with fewer cautions? 

Percentiles 



Grade Equivalents 
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GRADE LEVEL ►K 1 2 3 4 & 6 7 8 
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9.4 

9.3 

9.2 

9.0 

8.9 

8.8 

8.7 

8.6 

8.5 

8.4 

8.3 

8.2* 

8.1 

8.0 

7.9 

7.8 

7.7 

7.6 

7.5* 

7.4 

7.3 

7.2 

7.1 

7.0 

6.9 

6.8 

6.7 

6.6 

6.5 

6.4 

6.3 

6.2 

6.1 

6.0 

5.9 

5.8 

5.7 

5.6 

5.5 

5.4 

5.3 

5.2 

5.1 

5.0 

4.9 

4.8 

4.7 

4.6 

4.5 

4.4 

4.3 

4.2 

4.1 

4.0 

3.9 

3.8 

3.7 

3.6 

3.5 

3.4 

3.3 

3.2 

3.1 

3.0 

2.9 

2.8 

2.7 

2.6 



PERCENTILE 
Language Hath 
Total Total 
99 



99 



DOES A STUDENT WHO MAKES THE SAME GRADE 
EQUIVALEKT IN TVO AREAS ALSO MAKZ THE 
SAME PERCENTILE IN BOTH? 



I 

•90 



•90 



75 



In this example. Math Total and 
Language Total are compared for fifth 
gradera taking the ITBS In April. 



75 



50 



50 



25 



25 



For a atudent who scorea at the 90th 
percentile In both language and math, 
what are this student's grade equl* 
valenta? 

90th Zlle - CE for Language Total 

90th Zlle - CE for Math Total 



10 



10 




21 



B-3 



AUSTIN INDEPENDENT SCHOOL DISTRICT 
Office of Research and Evaluation 



Normlng a Standardized Test 



STANDARDIZED^ NORM-REFERENCED TEST: a test with— 

1. standardized administration and scoring procedures, and 

2. national norms. 



NATIONAL NORMS: scales on which we can compare the scores mada by our 

students to the scores ^lade by students across the nation. 

NATIONAL NORM GROUP: the group of students who took the test when it was 

normec]. 



RAW SCORE: the number of items answered correctly. 




RAW SCORE 12 13 ^ 15 16 17 18 19 20 



Figure 1. Raw scores made by out national norm group. 

TEST: Vocabulary 

NUMBER OF ITEMS: 20 

NUMBER OF STUDENTS: 25 

TIME OF TESTING (NORKING) : April of Grade 5 
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MEDIAN: score which divides the students into two equal parts. (Always 

the 50th percentile in the national norm group) 



PERCENTILE RANK! percentage of students scoring below a raw score point. 




RAW SCORE 12 13 14 15 16 17 18 19 20 
PERCENTILE _________ 



Figure 2. Percentiles Associated with Each Raw Score. 
Which raw score represents the median? 
Which raw score represent . the 50th percentile? 
What are the percentiles fo'r the other rav scores? 
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GRADE EQUIVALENT: the grade and month for which a particular raw score Is 

the median. 

April of Grade 5 = 5.8 

April of Grade 7 » 



February of .Grade 2 » 




RAW SCORE 12 13 m 15 16 17 18 19 20 
GRADE EQUIVALENT 



Figure 3. Grade Equivalents Associated with Each Raw Score. 

What grade equivalent goes with a raw score of 16? 

What grade equivalent goes with the other raw scores below? 



RESULTS OF TESTING NATIONAL NORM GROUPS AT TACH fiRnHF 




Median 


Grade 




Raw Score 


Equivalent 


Grade 


In April 


assifcned 


3 


12 




4 


14 




5 


16 




6 


18 




7 


20 





TVhat grade equivalents go with the other raw scores? 
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FOR THE MOlft^ ADVANCED READER 



Figure 4 adds on iseveral other terms which ace sometimes encountered. 

NORMAL CURVE 5 ® distribution of scores that is shaped about the same as our 
distribution, lliis bell-shaped curve represents how skills 
such as vocabulary are distributed — most people being average 
with fewer and fewer people being at each point as ve go 
hi^er or lower in vocabulary skill* 

STANINESS ^ scale divided into nine equal parts. Notice that with our normal 
curve there are many more persons in the middle stanines and very 
few in the hl^ and low stanines. Stanines and percentiles are 
always related in the same way. For example » the third stanlne 
always includes persons scoring from the 11th to the 22nd percentile. 

NORMAL CURVE EQUIVALENTS (NCE's): roughly equal to stanines broken down 

into ten parts each. Stanines and NCE v 
are useful because they represent equal- 
interval scales and may be averaged. 
NCE's are sometimes used with test scores 
for the Title I Program. 



PERCENTAGE OF ITEMS CORRECT: the raw score divided by the total number of 

items on a test (multiplied by 100 to remove 
the decimal point). This is often confused 
with percentile. 




RAW SCORE 


0-12 


13 




15 


16 


17 


18 


19 


20 


PERCENTILE 


1 2 


« 8 


11 18 


23 32 


40 50 


60 68 


77 82 


89 92 


96 98 99 


STAINE 


1 


2 


3 


M 


5 


6 


7 


8 


9 


NCE 


10 


20 


30 


i\0 


50 


60 


70 


80 


90 


Z ITEMS CORRECT 


0-60 


65 


70 


75 


80 


85 


90 


95 


100 


GRADE EQUIVALENT 


3.8 


^1.3 


^1.8 


5.3 


5.8 


6.3 


6.8 


7.3 


7.8 



Figure A, Noras associated with each raw score. 
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GE TRAINING 
POSTTEST 
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Interpretation of Grarle Equivalents and Percentile Scores 



In grade equivalents, how much do the 3 months of summer count? 

a, 3 months 
/b. 1 month 
Cm 0 month 
nothing 



In which area Is a third-grade student farther behind more of his peers if 
his gride equivalent scores are 2.8 In rerdlng and 2.9 in math? 

a. Reading 
/b. hath 
c. Not known from this Information 



If a second grader gains in one year from a GE of 4.8 to a GE of 5.8, her 
percentile scores will most likely - 

a. go up 

b. stay the same 
v/c. go down 

d. not known from this information 



Which most likely represents the greatest g^ln in grade equivalents over a 
one-year period? 

a. 3.8 GE to 4.8 GE 

b. 20th %ile to 21st Xlle 
^c. 90th %lle to 91st Zile 

d. Not known from this Information 



(More on back) 




Would you use grade equivalents or percentiles to answer each of these 
questions? Think in terms of a low-achieving student* 

How close to grade level is this student achieving? 

^a. Grade equivalents 
Percentiles 
c. Both 



Did this student learn as much last year as did an average student? 

/a. Grade equivalents 
b« Percentiles 
c. Both 



What proportion of students did this student score higher than? 

a. Grade equivalents 
%/b. Percentiles 
Co Both 



Did this student learn as much last year as did other students who were 
at the same low level? 

a. Grade equivalents 
i/b. Percentiles 
c. Both 
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